6 research outputs found

    Software: QSAR for Anticancer Agents

    No full text
    CORrelations And Logic (CORAL at http://www.insili co.eu/coral) is freeware aimed at establishing a quantitative structure -property ⁄ activity relationships (QSPR ⁄ QSAR). Simplified molecular input line entry system (SMILES) is used to represent the molecular structure. In fact, symbols in SMILES nomenclatures are indicators of the presence of defined molecular fragments. By means of the calculation with Monte Carlo optimization of the so called correlation weights (contributions) for the above-mentioned molecular fragments, one can define optimal SMILES-based descriptors, which are correlated with an endpoint for the training set. The predictability of these descriptors for an external validation set can be estimated. A collection of SMILES-based models of anticancer activity of 1,4-dihydro-4-oxo-1-(2-thiazolyl)-1,8-naphthyridines for different splits into training and validation set which are calculated with the CORAL are examined and discussed. Good performance has been obtained for three splits: the r 2 ranged between 0.778 and 0.829 for the subtraining set, between 0.828 and 0.933 for the calibration set, and between 0.807 and 0.931 for the validation set. There are a number of systems for the establishing of the quantitative structure -property ⁄ activity relationships (QSPR ⁄ QSAR) based on different collections of molecular descriptors (1-4). The use of large databases, especially databases that are available via the Internet, is typical in modern natural sciences. The majority of the Internet databases oriented on molecular properties are based on the representation of the molecular structure by simplified molecular input line entry system (SMILES) (5-8). Thus, the development of molecular descriptors that are calculated directly from SMILES is an attractive scenario of the QSPR ⁄ QSAR researches. The CORrelations And Logic (CORAL) software is an attempt to develop the standardized SMILES-based optimal descriptors. The aim of the present publication is the demonstration of the ability of the CORAL freeware to be a tool for the QSAR modelling. The numerical data on the anticancer activity for 1,4-dihydro-4-oxo-1-(2-thiazolyl)-1,8-naphthyridines (9) is used to demonstrate this freeware in practice. Most popular 'classic' approach of QSAR modelling can be formulated as the following: (i) definition of a model with compounds of the training set; and (ii) checking of the model with compounds of an external validation set. One can formulate a few questions related to the optimization of this approach. For instance, how the statistical quality of the model will be modified in case of another split into the training and validation sets? How to avoid the overtraining (i.e., how avoid the situation when a good model for the training set becomes a poor model for external substances)? How one can estimate the probability of obtaining a satisfactory and reliable model? Algorithms that are used in the CORAL can give some solutions for the above-mentioned problems from a probabilistic point of view. In fact, CORAL is a producer of random models, which are calculated by the Monte Carlo method. A random model can be a reasonable predictor for an endpoint, if the statistical quality of this model (for both the training and validation set) can be reproduced in a sequence of attempts to build this model. Obeying to this logic, we have examined three different splits in a cascade of attempts to build the models for the anticancer activity. In addition to the above-mentioned classic scheme, one can use the balance of correlations that is available in the CORAL. The basic idea of the balance of correlations is the split of the training set into sub-training and calibration set. The preliminary check of the model is the function of the calibration set. This preliminary check helps to avoid the overtraining. As further step to improve the predictability, the balance of correlations with ideal slopes has been examined. Slopes of the cluster at the plot of experimental versus calculated values of the endpoint on the subtraining and calibration set are ideal if their values are as equivalent as possible. Thus, the discussion of the CORAL as a tool for QSPR ⁄ QSAR analyses is the aim of the present work, considering the specific case study of anticancer activity. Method Data The concentration of the agent to reduce cell viability by 50%, against Murine P388 Leukemia IC 50 (9) is the biological activit
    corecore